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Abstract 

We consider the distance from a (square or rectangular) matrix pencil to the nearest 
matrix pencil in 2-norm that has a set of specified eigenvalues. We derive a singular 
value optimization characterization for this problem and illustrate its usefulness for two 
applications. First, the characterization yields a singular value formula for determining 
the nearest pencil whose eigenvalues lie in a specified region in the complex plane. For 
instance, this enables the numerical computation of the nearest stable descriptor system 
in control theory. Second, the characterization partially solves the problem posed in 
[Boutry et al. 2005] regarding the distance from a general rectangular pencil to the near- 
est pencil with a complete set of eigenvalues. The involved singular value optimization 
problems are solved by means of BFGS and Lipschitz-based global optimization algo- 
rithms. 
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1 Introduction 

Consider a matrix pencil A — XB where A,B& c™ xm with n > m. Then a scalar p e C is 
called an eigenvalue of the pencil if there exists a nonzero vector v e C" such that 

(A-pB)v = 0. (1) 

The vector v is said to be a (right) eigenvector associated with p and the pair (p, v) is said 
to be an eigenpair of the pencil. 
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In the square case m = n, the eigenvalues are simply given by the roots of the characteristic 
polynomial det(j4 — XB) and there are usually n eigenvalues, counting multiplicities. The 
situation is quite the opposite for n > m. Generically, a rectangular pencil A — XB has no 
eigenvalues at all. To see this, notice that a necessary condition for the satisfaction of (JlJ is 
that n\j ((n — m)\m\) polynomials, each corresponding to the determinant of a pencil obtained 
by choosing m rows of A — XB out of n rows, must have a common root. Also, the generic 
Kronecker canonical form of a rectangular matrix pencil only consists of singular blocks (see 
[9]). Hence, is an ill-posed problem and requires reformulation before admitting numerical 
treatment. 

To motivate our reformulation of ([!]), we describe a typical situation giving rise to rect- 
angular matrix pencils. Let M £ <C nxn and suppose that the columns of U € c™ x ™ form 
an orthonormal basis for a subspace W C C n known to contain approximations to some 
eigenvectors of M . Then it is quite natural to consider the n x m matrix pencil 

A — XB := MU - XU. (2) 

The approximations contained in W and the approximate eigenpairs of A — XB are closely 
connected to each other. In one direction, suppose that (p, x) with i£W satisfies 

(Af + AM - pl)x = (3) 

for some (small) perturbation AM . Then there is v € C™ such that x = Uv. Moreover, we 
have 

(A + AA - P B)v = (4) 

with A A := AM ■ U satisfying ||AA|| 2 < \\AM || 2 . In the other direction, the relation Q with 
an arbitrary A A implies Q with AM = A A ■ U* satisfying ||AM|| 2 = ||A^4|| 2 . Unless M is 
normal, the first part of this equivalence between approximate eigenpairs of M and A — XB 
does not hold when the latter is replaced by the more common compression U*MU. This 
observation has led to the use of rectangular matrix pencils in, e.g., large-scale pseudospectra 
computation (see [3T]) and Ritz vector extraction (see [T7]). 

This paper is concerned with determining the 2-norm distance from the pencil A — XB to 
the nearest pencil (A + AA) — XB with a subset of specified eigenvalues. To be precise, let 
§ = {Ai, . . . , Afc} be a set of distinct complex numbers and let r be a positive integer. Let 
nij(A + AA, B) denote the (possibly zero) algebraic multiplicity of Xj as an eigenvalue of 
(A + AA) — XB. Then we consider the distance 

fc 

7v(§) :=inf |||AA|| 2 :^m 3 {A + AA, B) > r}. (5) 

3=1 

We allow B to be rank-deficient. However, we require that rank(£>) > r. Otherwise, if 
rank(B) < r, the pencil (A + AA) — XB has fewer than r finite eigenvalues for all AA and 
consequently the distance r r (S) is ill-posed. 



1 For a rectangular matrix pencil, the algebraic multiplicity of \j is denned as the sum of the sizes of 
associated regular Jordan blocks in the Kronecker canonical form, see also Section [2] By definition, this 
number is zero if \j is actually not an eigenvalue of the pencil. 
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For k = r = 



1, it is relatively easy to see that 



ri({Xi}) 



a m (A-XiB) 



where, here and in the following, er^ denotes the fcth largest singular value of a matrix. 
(The particular form of this problem with k = r = 1, and when A and B are perturbed 
simultaneously, is also studied for instance in [7].) One of the main contributions of this 
paper is a derivation of a similar singular value optimization characterization for general k 
and r, which facilitates the computation of r r (S). Very little seems to be known in this 
direction. Existing results concern the square matrix case (m = n and B = I); see the works 
by [21] for k = 1 and r = 2 as well as [Hj for k = 2 and r = 2. [T5] for k = 1 and r = 3. 
and [25] for fc = 1 and arbitrary r. Some attempts have also been made by [22] for arbitrary 
k and r and for the square matrix case, and by [27] for k = 1 and r = 2 and for the square 
matrix polynomial case. 

Another class of applications arises in (robust) control theory, where a number of tasks 
require the determination of a (minimal) perturbation that moves some or all eigenvalues into 
a certain region in the complex plane. With the region of interest denoted by C C, the 
results in this paper are an important step towards rendering the numerical computation of 
the distance 



feasible. Here and in the following, multiple eigenvalues are counted according to their alge- 
braic multiplicities. For r = 1 and f2 equal to C + (right-half complex plane), the quantity 
Ti(C + ) amounts to the distance to instability, also called stability radius. In [35], a singular 
value characterization of ti(C + ) was provided, forming the basis of a number of algorithms 
for computing ri(C + ), see, e.g., [3J |6]. In our more general setting, we can also address the 
converse question: Given an unstable matrix pencil A — AB, determine the closest stable 
pencil. Notice that this problem is intrinsically harder than the distance to instability. For 
the distance to instability it suffices to perturb the system so that one of the eigenvalues is 
in the undesired region. On the other hand to make an unstable system stable one needs to 
perturb the system so that all eigenvalues lie in the region of stability. 
An important special case, 17 = C leads to 



For r = 1 and particular choices of rectangular A and B, the distance Ti(C) corresponds to 
the distance to uncontrollability for a matrix pair (see [5j HQ]). For general r, a variant of 
this distance was suggested in [2] to solve an inverse signal processing problem approximately. 
More specifically, this problem is concerned with the identification of the shape of a region 
in the complex plane given the moments over the region. If the region is assumed to be a 
polygon, then its vertices can be posed as the eigenvalues of a rectangular pencil A—XB, where 
A and B are not exact due to measurement errors, causing the pencil to have no eigenvalues 
(see [11] for details). Then the authors attempt to locate nearby pencils with a complete set 



r r (fi) 




TV-(C) 



inf{|| AA|| 2 : {A + AA) - XB has r finite eigenvalues } 



inf tv(§). 

sec 



Generalized Eigenvalue Problems with Specified Eigenvalues 



4 



of eigenvalues. In this work we allow perturbations to A only, but not to B. This restriction 
is only justified if the absolute value of A does not become too small. We consider our results 
and technique as significant steps towards the complete solution of the problem posed in [11] . 

The outline of this paper is as follows. In the next section, we review the Kronecker canon- 
ical form for the pencil A — XB. In |3j we derive a rank characterization for the condition 
Y]j—i mj(A,B) > r. This is a crucial prerequisite for deriving the singular value character- 
izations of r r (S) in Sj4] We discuss several corollaries of the singular value characterizations 
for T r (§), in particular for T r (Sl) and r r (C), in Sj5j The singular value characterizations are 
deduced under certain mild multiplicity and linear independence assumptions. Although we 
expect these assumptions to be satisfied for examples of practical interest, they may fail to 
hold as demonstrated by an academic example in f|6] Interestingly, the singular value charac- 
terization remains true for this example despite the fact that our derivation no longer applies. 
Finally, a numerical approach to solving the involved singular value optimization problems 
is briefly outlined in ^7] and applied to a number of settings in The main point of the 
developed numerical method and the experiments is to demonstrate that the singular value 
characterizations facilitate the computation of r r (S), r r (f2) and r r (C). We do not claim that 
the method outlined here is as efficient as it could be, neither do we claim that it is reliable. 



2 Kronecker Canonical Form 

Given a matrix pencil A — XB E C nXm , the Kronecker canonical form (KCF), see [13], states 
the existence of invertible matrices P G c™ x « and Q € c mx ™ such that the transformed 
pencil P(A — XB)Q is block diagonal with each diagonal block taking the form 

J p (a) — XI P or I p — AJp(O) or F p — XG P or — AGj, 

where 



J p (a) 



a 1 

o 



1 

a 



F„ = 



1 



1 



i G v — 



1 



1 



(6) 



px(p+l) 



px(p+l) 



for some a G C. Regular blocks take the form J p {a) ~ XI p or I p — AJ P (0), with p > 1, 
corresponding to finite or infinite eigenvalues, respectively. The blocks F p — XG p and F^—XGp 
are called right and left singular blocks, respectively, with p > corresponding to a so called 
Kronecker index. 

In large parts of this paper, indeed until the main singular value optimization characteri- 
zation, we will assume that A — XB has no right singular blocks F p — XG p . Eventually, we will 
remove this assumption by treating the occurence of such blocks separately in Section |4.3| 



Generalized Eigenvalue Problems with Specified Eigenvalues 



5 



3 Rank Characterization for Pencils with Specified Eigen- 
values 

In this section we derive a rank characterization for the satisfaction of the condition 

k 

J2mj(A,B)>r, (7) 

where rrij (A, B) denotes the algebraic multiplicity of the eigenvalue Xj . The following classical 
result [T31 Theorem 1, p. 219] concerning the dimension of the solution space for a Sylvester 
equation will play a central role. 

Theorem 3.1. Let F E C mxm an d Q £ C' xr . Then the dimension of the solution space for 
the Sylvester equation 

FX — XG = 

only depends on the Jordan canonical forms of the matrices F and G. Specifically, suppose 
that Hi, . . . , He are the common eigenvalues of F and G. Let Cj t i, . . . , Cj^ and Pj t i, ■ ■ ■ ,Pj g. 
denote the sizes of the Jordan blocks of F andG associated with the eigenvalue Hj, respectively. 
Then 

l lj h 

dim{X G C mxr : FX - XG = 0} = J2J2J2 mhx ( c j,i>Pi,q)- 

j=l i=l q=l 

For our purposes, we need to extend the result of Theorem |3.1| to a generalized Sylvester 
equation of the form 

AX - BXC = 0, (8) 

where C is a matrix with the desired set of eigenvalues S and with correct algebraic multi- 
plicities. For this type of generalized Sylvester equation, the extension is straightforward^ 
To see this, let us partition the KCF 

P(A - XB)Q = diag (A F - XI, I - XA X , A s - XB S ) , (9) 

such that 

• Ap — XI contains all regular blocks corresponding to finite eigenvalues; 

• / — XAi contains all regular blocks corresponding to infinite eigenvalues; 

• As — XBs contains all left singular blocks of the form F^ — XGp. 

As explained in Section [2j we exclude the occurence of right singular blocks for the moment. 
Note that the finite eigenvalues of A — XB are equal to the eigenvalues of Ap with the same 
algebraic and geometric multiplicities. 

Using {9), X is a solution of the generalized Sylvester equation ^ if and only if 

(PAQ){Q- 1 X)-{PBQ)(Q- 1 X)C = <^> diag (Ap, I, As) Y — diag (/, A%, B$) YC = 

2 119| provides an extension of Theorem 3.1 to a more general setting. 
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where Y = Q 1 X. Consequently, the dimension of the solution space for Q is the sum of 
the solution space dimensions of the equations 

A F Y 1 - Y X C = and Y 2 - A r Y 2 C = and A S Y 3 - B S Y 3 C = 0. 

Results by [9] show that the last two equations only admit the trivial solutions Y% = and 
Y 3 = 0. To summarize: the solution spaces of the generalized Sylvester equation ^ and the 
(standard) Sylvester equation 

A F X - XC = 



have the same dimension. Applying Theorem 3.1 we therefore obtain the following result. 



Theorem 3.2. Let A, B £ C nxm with n > m be such that the KCF of A - \B does not 
contain right singular blocks. Then the dimension of the solution space for the generalized 
Sylvester equation 

AX - BXC = 

only depends on the Kronecker canonical form of A — XB and the Jordan canonical form of 
C G C rxr . Specifically suppose that fix, . . . , \ig are the common eigenvalues of A — XB and 
C . Let Cj i, . . . , Cj,i j and Pj,i, . . . denote the sizes of the Jordan blocks of A — XB and C 

associated with the eigenvalue fj,j, respectively. Then 



dim{X e C mxr : AX - BXC = 0} =SS£min(c f ,i ) pJ, 

j=l i=l q=l 

We now apply the result of Theorem |3.2| to the generalized Sylvester equation 

AX — BXCiji, T) = 0, 

where C (p, T) takes the form 



(10) 



o 



-721 

/'2 



"Trl 



"7r,r-l 
//.,. 



(11) 



[ Ml 



fr] T eS r , r=[ 721 



ith 

731 ■ • ■ lr,r-l \ € L v ;/ . 

As explained in the introduction, the set § = {Ai, . . . , A^} contains the desired approximate 
eigenvalues. Suppose that Xj occurs pj times in p.. Furthermore, as in Theorem 3.2 denote the 
sizes of the Jordan blocks of A — XB and C(/x, T) associated with the scalar Xj by Cj ; i, . . . , Cj^. 

and Pj,x, ■ ■ ■ iVjl-i respectively. Note that pj = Y^q=i Pj.q- I n fa-ct, for generic values of Y the 
matrix C(/i,Y) has at most one Jordan block of size pj associated with Xj for j = 1, . . . , k, 
see 0- In the following, we denote this set of generic values for Y by Q{p). By definition, this 
set depends on fi but not on A — XB. 
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First, suppose that inequality (|7|) holds. If we choose \i such that J2j=iPj 
Pj < mj(A.B) = ~Y^i=\ c j,ii then Theorem 3.2 



and 



implies that the dimension of the solution 



space for the generalized Sylvester equation ( 10 1 is 

fc tj lj fc lj fc fc 

E E E ™ n ( c j,i::Pj,<z) -EE min ( c j,i>Pj) ^ E min(mj(A, i?) , Pj ) = = r. 

j=l i=l q=l j=l i=l j=l j=l 

In other words, there exists a vector fi with components from S such that the dimension of 
the solution space of the Sylvester equation ( |To| ) is at least r. 

Now, on the contrary, suppose that inequality ([7| does not hold. Then for generic values 
r G G(fi), the solution space dimension of (10) is 



E E min ( c 

j=x i=i 



< 



fc ti 

EE' 

j=i »=i 



j=i 



In other words, no matter how /i is formed from §, the dimension is always less than r for 
r € This shows the following result. 

Theorem 3.3. Let A, B G C nxm with n > m be such that the KCF of A - XB does not 
contain right singular blocks. Consider a set S = {Ai, . . . , A&} of distinct complex scalars, and 
a positive integer r. Then the following two statements are equivalent. 

(1) '^2^ = i'rnj(A, B) > r, where mj(A,B) is the algebraic multiplicity of \j as an eigenvalue 
ofA-XB. 



(2) There exists [i € § r such that 

dim{Jf G 



AX - BXC{p,T) = 0} > r 



for all r G G{fJ>)> where C(/x, T) is defined as in (11). 



To obtain a matrix formulation of Theorem |3.3| we use the Kronecker product ® to vectorize 
the generalized Sylvester equation (flOl) and obtain 



(((7 ® A) - (C T 0, T) <g) B)) vec(X) = £(/*, T, A, B)vec(X) = 0, 
with the lower block triangular matrix 

A- HiB 



£{H,T,A,B) 



721-B A - [i 2 B 



IrlB 



A — /i r _i_B 

7 r r _lB A — fl r B 



(12) 



The operator vec stacks the columns of a matrix into one long vector. Clearly, the solution 
space of the generalized Sylvester equation and the null space of C(fi, T, A, B) have the same 
dimension. Consequently, Theorem |3.3| can be rephrased as follows. 
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Corollary 3.4. Under the assumptions of Theorem 3.3. the following two statements are 
equivalent. 

(1) £j=i"b-(4B)>r. 

(2) There exists fi G S r such that rank (C(fJ>, T, A, B)) < mr — r for all T G G(p)- 



4 A singular value characterization for the nearest pencil 
with specified eigenvalues 

As before, let § = {Ai, . . . , A^} be a set of distinct complex scalars and let r be a positive 
integer. The purpose of this section is to derive a singular value optimization characterization 
for the distance t>(S) defined in ([5]). Our technique is highly inspired by the techniques in 
25, 26] and in fact the main result of this section generalizes the singular value optimization 
characterizations from these works. We start by applying the following elementary result |14[ 
Theorem 2.5.3, p. 72] to the rank characterization derived in the previous section. 

Lemma 4.1. Consider C G C q anc [ a positive integer p < mm(£,q). Then 
inf {||AC|| a : rank(C + AC) < p} = a p+1 (C). 

Defining 

V r (ji) :=inf {||AA|| 2 : rank (£(/i, T,A + AA, B)) < mr - r) (13) 
for some V G G(p), Corollary |3.4| implies 

7V (S) = inf V r (ji), 

independent of the choice of T. By Lemma [4~T| it holds that 

V r (ji) = inf{||AA|| 2 : rank (£(ji,T, A + AA, B)) < mr - r} 
> <T mr - r+ i(C(p,T,A,B)), 

using the fact that A enters C linearly Note that this inequality in general is not an equality 
due to the fact that the allowable perturbations to C(fi, T, A, B) in the definition of 7-V(/i) 
are not arbitrary. On the other hand, the inequality holds for all T G G(fJ-) and hence - by 
continuity of the singular value ow_ r+ i(-) with respect to T - we obtain the lower bound 

V r {p) > sup o- mr - r+ i (C(n,T , A, B)) =: K r (fi). (14) 

For m = n, it can be shown that ow-r+i T, A, B)) tends to zero as ||r|| := £ |7y| ~^ 00 
provided that rank(S) > r; see Appendix [A] for details. From this fact and the continuity of 
singular values, it follows that the supremum is attained at some in the square case: 

K r (p) = a mr - r +i (C{fi,T^,A,B)) . 

In the rectangular case, numerical experiments indicate that the supremum is still attained 
if rank(i?) > r, but a formal proof does not appear to be easy. Moreover, it is not clear 
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whether the supremum is attained at a unique or not. However, as we will show in the 
subsequent two subsections, any local extremum of the singular value function is a global 
maximizer under mild assumptions. (To be precise, the satisfaction of the multiplicity and 
linear independence qualifications at a local extremum guarantees that the local extremum is 
a global maximizer; see Definitions |4.2| and |4.3| below for multiplicity and linear independence 
qualifications.) 

Throughout the rest of this section we assume that the supremum is attained at some F* 



and that £ Q{^)- The latter assumption will be removed later, in Section 4.3 

We will establish the reverse inequality V r {\i) < K r (fx) by constructing an optimal pertur- 
bation AA* such that 

(i) || AA^ || 2 = «v(m)j an -d 

(ii) rank(£(/z,r*,A + AA*,B)) < mr - r. 

Let us consider the left and right singular vectors U £ C rn and V £ C™ satisfying the 
relations 



C(li,T*,A,B)V = K r ( f i)U, U* £(//, T*, A, B) = V* K r (fx), \\U\\ 2 = \\V\\ 2 = 1. (15) 

The aim of the next two subsections is to show that the perturbation 

AA := -K r (fj,)UV + (16) 

with U £ C nxr and V £ C mxr such that vec(W) = U and vec(V) = V satisfies properties (i) 
and (ii). Here, V + denotes the Moore-Penrose pseudoinverse of V. The optimality of A A 
will be established under the following additional assumptions. 

Definition 4.2 (Multiplicity Qualification). We say that the multiplicity qualification holds 
at (n, r) for the pencil A — XB if the multiplicity of the singular value cr mr _ r+ i (>C(/i, T, A, B)) 
is one. 

Definition 4.3 (Linear Independence Qualification). We say that the linear independence 
qualification holds at (/!, L) for the pencil A—XB if there is a right singular vector V associated 
with cr mr _ r+ i (£(/i, r, A, B)) such that V £ C mxr ; with vec(V) = V, has full column rank. 



4.1 The 2-norm of the optimal perturbation 

Throughout this section we assume that the multiplicity qualification holds at the optimal 
(//, r*) for the pencil A — XB. Moreover, we can restrict ourselves to the case K r (n) ^ 0, as 
the optimal perturbation is trivially given by A A* — when K r (/i) = . 

Let ^4(7) be a matrix-valued function depending analytically on a parameter 7 £ I. If 
the multiplicity of crj (.4(7*)) is one and Oj (^(7*)) 7^ 0, then Oj (^(7)) is analytic at 7 = 7*, 
with the derivative 

™-B.(«S^), (.7) 

where uj and Vj denote a consistent pair of unit left and right singular vectors associated with 
<7j (.4(7*)), see, e.g., @J MM- 
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Let us now define 

f(T) :=a nr - r+1 (£(»,T,A,B)), 

where we view / as a mapping ]R''( r_1 ) — ^ R by decomposing each complex parameter 7^ 
contained in V into its real and imaginary parts Sfry^ and Qjji- By (17), we have 

- MU-BV,), »Sg - Re(i UjBVi) - -Im^TO), 

where J7j 6 C™ and G C m denote the jth and £th block components of [/ and V, respec- 
tively. Furthermore, the fact that is a global maximizer of / implies that both derivatives 
are zero. Consequently we obtain the following result. 

Lemma 4.4. Suppose that the multiplicity qualification holds at (/i, T*) for the pencil A — XB 
and K r (ii) y^= 0. Then U*BVg = for all j = 2, . . . , r and I = 1, . . . , j — 1. 

Now by exploiting Lemma |4.4| we show U*U = V*V. Geometrically this means that the 
angle between Ui and Uj is identical with the angle between Vi and Vj. 



Lemma 4.5. Under the assumptions of Lemma 4-4 H holds thatlA*U = V*V. 



Proof. Expressing the first two equalities in the singular value characterization ( 15 1 in matrix 
form yields the generalized Sylvester equations 

AV - BVC(p,T,) = Kr (lJ,)U 

and 

U*A-C{ii,T*)U*B = «v(»V*. 

By multiplying the first equation with U* from the left-hand side, multiplying the second 
equation with V from the right-hand side, and then subtracting the second equation from the 
first we obtain 

K r (fx) (U*U - V* V) = C(p, T*)U*BV - U*BVC(ji, r*). (18) 



Lemma 4.4 implies that IA*BV is upper triangular. Since C(/j,, r„) is also upper triangular, 
the right-hand side in ( 18 1 is strictly upper triangular. But the left-hand side in ( 18 1 is 
Hermitian, implying that the right-hand side is indeed zero, which - together with K r ([i) 7^ 
- completes the proof. □ 



The result of Lemma 4.5 implies ||WV + ||2 = 1. A formal proof of this implication can 
be found in [24., Lemma 2] and [25j Theorem 2.5]. Indeed, the equality ||WV + ||2 = 1 can 
be directly deduced from ||WV + £||2 = ||VV + x||2 for every x (implied by Lemma 4.5), and 
||V' 1 ^ + ||2 = 1 (since VV + is an orthogonal projector). 

Theorem 4.6. Suppose that the multiplicity qualification holds at (//,r t ) for the pencil A — 
XB. Then the perturbation A A* defined in (16) satisfies \\AA*\\2 = n r (p). 
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4.2 Satisfaction of the rank condition by the optimally perturbed 
pencil 



Now we assume that the linear independence qualification (Definition 4.3) holds at (fx, r*) for 
the pencil A — XB. In particular we assume we can choose a right singular "vector" vec (V) 
so that V has full column rank. We will establish that 

rank(£(/x, T,, A + AA*,B)) <mr-r (19) 



for A A* defined as in (16). 

Writing the first part of the singular vector characterization ( 15 ) in matrix form leads to 
the generalized Sylvester equation 

AV - BVC(fj,,T*) = n r {p)U. 

The fact that V has full column rank implies V + V = I and hence 

AV - BVC(fj,, r*) = K r (n)UV+V 
=> (A — K r (/i)UV + )V — BVC(n, vj) = 
=> {A + AaJ)V-BVC(h,T,) = 0. 

Let us consider M = {D E C rxr : C^^Y^D - DC[n,T t ) = 0}, the subspace of all r xr 
matrices commuting with C(/i, T*). By Theorem |3.1| Ai is a subspace of dimension at least r. 
Clearly for all D 6 A4, we have 

= {A + AAJVD - BVC((i, r t )D = (A + AA*)(VD) - B(VD)C{n, r»). 

In other words, {VD : D E A4} has dimension at least r (using the fact that V has full column 
rank) and represents a subspace of solutions to the generalized Sylvester equation 

(A + AA, )X - BXC(n, r*) = 0. 



Reinterpreting this result in terms of the matrix representation, the desired rank estimate ( 19 ) 
follows. This completes the derivation of V r {ii) < K r {n) under the stated multiplicity and 
linear independence assumptions. 

4.3 Main Result 

To summarize the discussion above, we have obtained the singular value characterization 

r r (§) = inf supow_ r+ i (£(n,T,A,B)) . (20) 

fJ,€S r p 

Among our assumptions, we have 

(i) the KCF of A - XB has no right singular blocks and (ii) E G(n)- (21) 

In this section, we show that these two assumptions can be dropped. We still require that 
rank(S) > r. As explained in the introduction, the distance problem becomes ill-posed 
otherwise. 
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(i) Suppose that the KCF of A- XB contains a right singular block E p - XG P G R px for 
some p > 0. By Sec. 4], the generalized Sylvester equation F p Y — G p XC(p, T) = 
has a solution space of dimension r. This implies that also the solution space of AX — 
BXC{p,T) = has dimension at least r, and consequently cr„ lr _ r+ i (£(p,T,A,B)) is 
always zero. On the other hand, the presence of a right singular block implies that for 
any e > and pi, . . . , p r € C with r < rank(_B) there is a perturbation A A such that 
||AA||2 < £ and (A + AA) — XB has eigenvalues /ii, . . . ,p r , see [5]. This shows r r (§) = 



and hence both sides of ( 20 1 are equal to zero 



In summary, we can replace the assumption (i) by the weaker assumption rank(_B) > r. 



(ii) To address (ii), we first note that both V r (p) and n r {p), defined in ( |13| and (14 1, 
change continuously with respect to p. Suppose that /x has repeating elements, which 
allows for the possibilitiy that ^ Q{p)- But for all /x with distinct elements, we 
necessarily have Q{p) = C r ( r - 1 )/ 2 . Moreover, when jl is sufficiently close to \i then 
V r (Ji) = K r (p), provided that the multiplicity and linear independence assumptions 
hold at (pyTi,,) (implying the satisfaction of these two assumptions for p also). Then 
the equality V r (p) = n r (p) follows from continuity. Consequently, the assumption (ii) 



in (21) is also not needed for the singular value characterization. 



We conclude this section by stating the main result of this paper. 

Theorem 4.7 (Nearest Pencils with Specified Eigenvalues). Let A — XB be an n x m pencil 
with n > m, let r be a positive integer such that r < rank(_B) and let S = {Ai, . . . , A^} be a 
set of distinct complex scalars. 



(i) The 



r r (S) = inf sup<T mr _ r+ i (£(fi,T,A,B)) 



holds, provided that the optimization problem on the right is attained at some (/it*, r») for 
which is finite and the multiplicity as well as the linear independence qualifications 
hold. 



(ii) A minimal perturbation AA* such that^2^ =1 m(A + AA*, B) > r is given by [16), with 



p replaced by p* . 



5 Corollaries of Theorem 14.7 



As discussed in the introduction one potential application of Theorem 4.7 is in control theory, 
to ensure that the eigenvalues lie in a particular region in the complex plane. Thus let f2 be 
a subset of the complex plane. Then, provided that the assumptions of Theorem |4 . 7| hold , we 
have the following singular value characterization for the distance to the nearest pencil with 
r eigenvalues in Q: 



rJfl) := inf tJS) 

SCO 



; inf inf sup c mr _ r+ i (C (p, T, A, B) 

SCO ^tCS r p 



(22) 



= inf sup er T( 
/igO 7 ' p 



r+ i(£ (p,T,A,B) 
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where fl r denotes the set of vectors of length r with all entries in f2. 

When the pencil A — XB is rectangular, that is n > to, the pencil has generically no 
eigenvalues. Then the distance to the nearest rectangular pencil with r eigenvalues is of 
interest. In this case, the singular value characterization takes the following form: 

T r (C) = inf SUp Umr-r+1 (£ (a*j r, A, B)) . (23) 

M6C r r 

The optimal perturbations A A* such that the pencil (A + AA*) — XB has eigenvalues (in 



C and Q) are given by (16), with [i replaced by the minimizing \x values in (23) and ( |22 
respectively. 



6 Multiplicity and linear independence qualifications 



The results in this paper are proved under the assumptions of multiplicity and linear indepen- 
dence qualifications. This section provides an example for which the multiplicity and linear 
independence qualifications are not satisfied for the optimal value of T. Note that this does 
not mean that these assumptions are necessary to prove the results from this paper. In fact, 
numerical experiments suggest that our results may hold even if these assumptions are not 
satisfied. 

Consider the pencil 



" -1 





" 




"0 





0" 





5 





- A 





1 











2 










1 



Let (i — [5 l] , that is, the target eigenvalues are 5 and 1. Then it is easy to see that the 
optimal perturbation is given by 



AA* = 





0-1 



The singular values of the matrix £([i, 7, A, B) are 



0, 1, VI6+I7I 2 , y5+i| 7 | 2 ±^l7l 4 + 20| 7 | 2 +64 
where the multiplicity of the singular value 1 is two. Hence 

CT5 (£(/z, 7, A, B)) — y 5 +i| 7 | 2 -^|7l 4 + 20| 7 | s 



■20|7| 2 + 64. 

Clearly the supremum is attained for 7 = and 175 (£([1, 0, A, B)) = 1. Hence the multiplicity 
condition at the optimal 7 is violated. All three pairs of singular vectors corresponding to 
the singular value 1 at the optimal 7 violate the linear independence condition, but one pair 
does lead to the optimal perturbation AA* . 
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7 Computational issues 

A numerical technique that can be used to compute r r (0) and r r (C) based on the singular 
value characterizations was already described in |25[ 126] . For completeness, we briefly recall 
this technique in the following. The distances of interest can be characterized as 

T r (f2) = inf g(n) and i~ r (C) = inf g(n), 

where g : C r — > M is defined by 

g(n) := sup cr mr - r+ i(C(fi,r, A, B)) . 

recc*- 1 )/ 2 

The inner maximization problems are solved by BFGS, even though cr mr _ r+ i(-) is not 
differentiable at multiple singular values. In practice this is not a major issue for BFGS as 
long as a proper line search (e.g., a line search respecting weak Wolfe conditions) is used, as 
the multiplicity of the rth smallest singular value is one generically with respect to T for any 
given /i; see the discussions in 20J. If the multiplicity and linear independence qualifications 
hold at a local maximizer T*, then is in fact a global maximizer and hence g(fj-) is retrieved. 
If, on the other hand, BFGS converges to a point where one of these qualifications is violated, 
it needs to be restarted with a different initial guess. In practice we have almost always 
observed convergence to a global maximizer immediately, without the need for such a restart. 

Although the function g{p) is in general non-convex, it is Lipschitz continuous: 

\g{n + 6n)-g{n)\ < HMb • \\B\\ 2 . 

There are various Lipschitz-based global optimization algorithms in the literature stemming 
mainly from ideas due to Piyavskii and Shubert (see 28, 30]). The Piyavskii-Shubert algo- 
rithm is based on the idea of constructing a piecewise linear approximation lying beneath 
the Lipschitz function. We used DIRECT (see [TS] ) , a sophisticated variant of the Piyavskii- 
Shubert algorithm. DIRECT attempts to estimate the Lipschitz constant locally, which can 
possibly speed up convergence. 

The main computational cost involved in the numerical optimization of singular values 
is the retrieval of the rth smallest singular value of £(//, I\ A, B) at various values of n and 
r. As we only experimented with small pencils, we used direct solvers for this purpose. For 
medium to large scale pencils, iterative algorithms such as the Lanczos method (see [2]) are 
more appropriate. 

8 Numerical Experiments 

Our algorithm is implemented in Fortran, calling routines from LAPACK for singular value 
computations, the limited memory BFGS routine written by J. Nocedal (discussed in I23j ) for 
inner maximization problems, and an implementation of the DIRECT algorithm by Gabion- 
sky (described in |12| ) for outer Lipschitz-based minimization. A mex interface provides 
convenient access via Matlab. 

The current implementation is not very reliable, which appears to be related to the nu- 
merical solution of the outer Lipschitz minimization problem, in particular the DIRECT 
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algorithm and its termination criteria. We rarely obtain results that are less accurate than 
the prescribed accuracy. The multiplicity and linear independence qualifications usually hold 
in practice and don't appear to affect the numerical accuracy. For the moment, the imple- 
mentation is intended for small pencils {e.g., n,m < 100). 



8.1 Nearest Pencils with Multiple Eigenvalues 



As a corollary of Theorem |4.7| it follows that for a square pencil A 
having S = {/i} as a multiple eigenvalue is given by 



XB the nearest pencil 



T 2 (S) = SUp 



A — fiB 
jB A - fiB 



provided that the multiplicity and linear independence qualifications are satisfied at the op- 
timal (fi, 7*). Therefore, for the distance from A — XB to the nearest square pencil with a 
multiple eigenvalue the singular value characterization takes the form 



inf SUpt7 2 n-l 

Specifically, we consider the pencil 



A- [iB 
jB A- [iB 



A-XB = 



2 -1 
-1 2 
-1 -1 



-1 
-1 
2 



- A 



-1 2 
2 -1 
4 2 



(24) 



(25) 



Solving the above singular value optimization problem results in a distance of 0.59299 to the 



nearest pencil with a multiple eigenvalue. By (16 1, a nearest pencil turns out to be 



1.91465 -0.57896 -1.21173 
-1.32160 1.93256 -0.57897 
-0.72082 -1.32160 1.91466 



- A 



-1 2 

2 -1 
4 2 



with the double eigenvalue A* = —0.85488. The optimal maximizing 7 turns out to be zero, 
which means neither the multiplicity nor the linear independence qualifications hold. (This is 
the non-generic case; had we attempted to calculate the distance to the nearest pencil with /1 
as a multiple eigenvalue for a given fi, optimal 7 appears to be non-zero for generic values of 



/i.) Nevertheless, the singular value characterization (24 1 remains to be true for the distance 
as discussed next. 

The e-pseudospectrum of A — XB (subject to perturbations in A only) is the set A t (A, B) 

< e. Equivalently, 



containing the eigenvalues of all pencils (^4 

A e (A,B) = {Xe 



A A) - XB such that ||AA|| 2 
: a min (A-XB) < e}. 



It is well known that the smallest e such that two components of A e (A,B) coalesce equals 
the distance to the nearest pencil with multiple eigenvalues. (See Q] for the case B = I, but 
the result easily extends to arbitrary invertible B.) Figure [l] displays the pseudospectra of 
the pencil in ( 25 ) for various levels of e. Indeed, two components of the e-pseudospectrum 



coalesce for e = 0.59299, confirming our result. 



Generalized Eigenvalue Problems with Specified Eigenvalues 



16 




-1.4 -1.2 -1 -0.8 -0.6 -0.4 -0.2 0.2 



Figure 1: Pseudospectra for the pencil in (25), with eigenvalues marked by the black crosses. 
Two components of the e-pseudospectrum coalesce for e = 0.59299, corresponding to the 
distance to a nearest pencil with a multiple eigenvalue A* = —0.85488 at the coalescence 
point (marked by the asterisk). 
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8.2 Nearest Rectangular Pencils with at least Two Eigenvalues 

As an example for a rectangular pencil, let us consider the 4x3 pencil 



A-\B = 



The KCF of this pencil contains a 4 x 3 singular block and therefore the pencil has no 
eigenvalues. However, if the entry 022 is set to zero, the KCF of the resulting pencil contains 
a 2 x 1 singular block and a 2 x 2 regular block corresponding to finite eigenvalues. Hence, a 
perturbation with 2- norm 0.1 is sufficient to have two eigenvalues. 

According to the corollaries in Section [5] the distance to the nearest 4x3 pencil with at 
least two eigenvalues has the characterization 



1 





" 




" 





" 





0.1 





- A 


1 











2 


0.3 





1 








1 


2 










1 



T 2 (C) = inf SUp(72m-l 

pec 2 7 



A - 
1 B A- ii 2 B 



(26) 



--■■g{n) 



for to — 3. Our implementation returns 72(C) = 0.03927. The corresponding nearest pen- 



cil ( 16 ) is given by 



0.99847 


-0.03697 


-0.01283 " 




" 





" 





0.08698 


0.03689 


-A 


1 











2.00172 


0.30078 





1 





0.00007 


1.00095 


2.00376 










1 



and has eigenvalues at [i\ — 2.55144 and fi 2 — 1.45405. This result is confirmed by Figure [2] 
which illustrates the level sets of the function g(jjL) defined in (26) over M 2 . 



For this example the optimal 7 is 2.0086. The smallest three singular values of the matrix in 
( 26 1 are 1.4832, 0.0393 and 0.0062 for these optimal values of /i and 7. The linear independence 



qualification also holds. 



8.3 Nearest Stable Pencils 



As a last example, suppose that Bx'(t) = Ax(t) with A,Be C rix ™ i s arL unstable descriptor 
system. The distance to a nearest stable descriptor system is a special case of r„(£l), with 
= C _ , the open left-half of the complex plane. A singular value characterization is given 

by 



") 



inf 



sup er n 2_ n+1 



A - XiB 
l21 B A - X 2 B 



Specifically, we consider a system with B = I 2 and 



.4 



0.6 



0.2+ p 

0.1+~§i 0.5 + |i 






A~ \„B 



\ 



(27) 
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Figure 2: Level sets over M 2 of the function g(fi) denned in (26). The asterisk marks the 
numerically computed global minimizer of g, which corresponds to the eigenvalues of a nearest 
pencil with two eigenvalues. 
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Figure 3: Pseudo-spectra of the matrix A denned in (27). The outer orange curve represents 
the boundary of the e-pseudospectrum for e = 0.6610, the distance to a nearest stable matrix. 



Both eigenvalues Ai = 0.7 — i and A2 = 0.4 + i are in the right-half plane. Based on the 
singular value characterization, we have computed the distance to a nearest stable system 
x'(t) = (A + AA*)x(i) as 0.6610. The corresponding perturbed matrix 



A + Aj4* = 



0.0681 - 0.3064i -0.4629 + 1.2524i 
0.2047 + 0.5858i -0.1573 + 0.3064i 



at a distance of 0.6610 has one eigenvalue (A*) 1 = — 0. 0885 + 0. 9547i in the left-half plane and 
the other (A*) 2 = — 0.9547i on the imaginary axis. The e-pseudospectrum of A is depicted 
in Figure [3] For e = 0.6610, one component of the e-pseudospectrum crosses the imaginary 
axis, while the other component touches the imaginary axis. 



9 Concluding Remarks 

In this work a singular value characterization has been derived for the 2-norm of a smallest 
perturbation to a square or a rectangular pencil A — XB such that the perturbed pencil has 
a desired set of eigenvalues. The immediate corollaries of this main result are 

(i) a singular value characterization for the 2-norm of the smallest perturbation so that 
the perturbed pencil has a specified number of its eigenvalues in a desired region in the 
complex plane, and 
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(ii) a singular value characterization for the 2-norm of the smallest perturbation to a rect- 
angular pencil so that it has a specified number of eigenvalues. 

Partly motivated by an application explained in the introduction, we allow perturbations 
to A only. The extension of our results to the case of simultaneously perturbed A and B 
remains open. 

The development of efficient and reliable computational techniques for the solution of the 
derived singular value optimization problems is still in progress. As of now the optimiza- 
tion problems can be solved numerically only for small pencils with small number of desired 
eigenvalues. The main task that needs to be addressed from a computational point of view 
is a reliable and efficient implementation of the DIRECT algorithm for Lipschitz-based opti- 
mization. For large pencils it is necessary to develop Lipschitz-based algorithms converging 
asymptotically faster than the algorithms (such as the DIRECT algorithm) stemming from 
the Piyavskii-Shubert algorithm. The derivatives from Section |4.1| might constitute a first 
step in this direction. 
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Proof that o 



mr- 



_i T, i,B))^0asr^oo 



We prove that the r smallest singular values of C(/J,, T, A, B) decay to zero as soon as at least 
one entry of V tends to infinity, provided that n = to. In the rectangular case, n > to, these 
singular values generally do not decay to zero. 

We start by additionally assuming that A — ^B are non-singular matrices for all i = 
1, . . . , r. We will first prove the result under this assumption, and then we will drop it. Our 
approach is a generalization of the procedure from |161 §5], which in turn is a generalization 
of HI Lemma 2]. 

Under our assumptions the matrix C(fj,,T,A, B) is non-singular, and one can explicitly 
calculate the inverse. It is easy to see that the matrix £ _1 (/^, V, A, B) has the form 



(A-f^B)- 1 

A 21 
A31 





(A-^B)- 1 
A3 2 







(A-firB)- 1 



X r i A,. 2 

We will use the well-known relations 

<y nr -r+i (£(/*> r, A, B)) = o r (£(n, T, A, B)- 1 )' 1 < u r (A,,)" 1 . 



(28) 



We first compute the matrices A21, . . . , A rjI ._i which lie on the first sub-diagonal. By a 
straightforward computation we obtain 

A i+M = -7 i+ i,i(A - fii+tB^BiA - fiiB)- 1 . 
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If cr r ((A - n i+ iB)- l B(A - ViB)- 1 ) > 0, then fr om ( [28] ) it follows that if any of |7i+i,i| tends 
to infinity, we obtain the desired result. But o r [(A — [j,i+iB)~ 1 B(A — /i^i?) -1 ) > easily 
follows from the assumption rank(i?) > r. 

If this is not the case, meaning maxj^+i^} is bounded, then we use the entries on the 
next sub-diagonal X i+2 .i- Again by straightforward computation we obtain 

Because again rank(B) > r implies ay ((A — fii + 2B)~ 1 B(A — /ijS) -1 ) > 0, it follows that if 
any of |7i+2,i| tend to infinity, we obtain the desired result. In general, we have the recursive 
formula 

j-i 

X i+ j ti = -ji+jjiA - u. i+J By 1 B(A - /Zj.B) _1 - }^ i+jti+k (A - /i 1+J B) _1 M 1+i>I . 

k=l 

Applying the same procedure as above, we conclude the proof in this case. 

To remove the assumption that the matrices A — fiiB are non-singular, we fix any e > 0. 
Let us choose a matrix A e such that \\A e — A\\ < e and that the matrices A e — fiiB are non- 
singular for all i — 1, . . . ,r. From the arguments above, if follows that there exists 70 > 
such that cr„ r _ r +i T, A e , B)) < e, when ||r|| > 70. Since 

Cw-r+i (£(p, r, A, B)) < (£(fi,r,A e ,B))+e, 

we obtain the inequality cr nr _ r+ i T, A, B)) < 2e, when ||r|| > 70- 
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